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VERSION WITH MARKINGS TO SHOW CHANGES MADE 

METHOD FOR FABRICATING AN OLFACTORY RECEPTOR-BASED 

BIOSENSOR 



This application is a continuation-in-part of application 09/057,181 filed April 8, 
1998, the entire disclosure of which is incorporated herein by reference. 



FIELD OF THE INVENTION 

10 

The present invention related generally to biosensors and, more specifically, to 
biosensors which have biomolecules attached to a thin film transducer. 



BACKGROUND OF THE INVENTION 

15 

Chemoreception is an ancient sense system that enables organisms to detect 
chemicals in its environment. In humans, odor receptor cells are located in the nose. 
The biochemical receptors for the odorants are transmembrane proteins found in the 
membrane of receptor cells cilia. Olfactory receptor proteins (ORP) generally have 
20 seven non-intersecting helices. It is believed that conserved residues determine the 
orientation of each helix relative to the other helices. — 

The detection of environmental chemicals is mediated by peripheral olfactory 
organs of varied complexity in almost all metazoans. Typically, specialized sensory 
neurons initiate perception by detecting ambient molecules, commonly called odors, 

l 



that interact with protein receptors in their membranes. ORP on the cilia detect the 
odorants entering the nose. The ORP are coded by approximately 1000 genes, and 
it is the largest gene family in the genome of any species. ORP are members of the 
proteins having seven transmembrane domains, i.e. G-protein couple receptor 
5 (GPCR) superfamily. They have a diverse amino acid seguence and are able to 
recognize a wide variety of structurally diverse odorants. The amino acid 
seguences of ORP are especially variable in the several transmembrane domains. 
This is a possible mechanism for the recognition of a variety of structurally diverse 
ligands. 

10 The major path of olfactory transduction is shown in Fig. 1 . Binding of the 

odor molecules to the receptors may activate a G-protein coupled enzymatic cascade 
to generate second messengers. These messengers can open the ion channels on 
the membrane of olfactory cells. The opening channels may depolarize the 
membrane and lead to action potentials and signaling. 

Wh o n tho odor mo lo cu lo b i nds to th e r e c o ptor (in th e transm e mbran e r e g i ons), 
i t i s b e li e ved that tho rocoptor mol o cu l os changes shape. — Th i s apparent l y act i vates 
a G prot ei n on th e i ntrac e l l u l ar surfac e of th e c ili a wh i ch i n turn b i nds to a G - prot ei n 
r o c o ptor on the ORP. — (Olf a ctory G prot ei n r o c o ptors aro ono of th o l arg e st groups 
of G prot ei n coup l ed r o c o ptors doscr i bod to dat e .) — O l factory G protoin l i nked 
r o c o ptors th e n trigg e r th o b i och e mica l synthesis of neurotransmitt e rs which op e n 
cat i on channe l s th a t u l t i mat e ly lo ad to action pot e ntia l s a nd s i gnal i ng, i . o . the sense 
of sme ll . In othor words, tho chemica l st i mu l us i s transduced i nto a neural event. 
Th e major path of o l factory transduct i on is shown i n F i gure 1 of th o draw i ngs. 

There is currently a need for sensors which function like an ORP eaf^ being 
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capable of detecting ligands , i.e. certain gas molecules, to be developed, of th e type 
which bind to olfactory roc o ptor prot o ins. The goal, then, is to assign funct i ona l 
odorants to sp e c i f i c o l factory r e c e ptors and to develop useful sensors for detecting 
the presence of th e odorant certain gas molecules according to the assignment of the 
5 certain gas molecules binding to certain sites of an ORP . It has been difficult in the 
past, however, to rapidly determine the secondary and tertiary molecular structures of 
ORP-s having olfactory receptor binding domains specific to selected ligands of 
interest. This is due in part to the complexity of ORP molecules. As w il l b o 
understood by those skilled in the art, in an empirical analysis, a determination of 

10 putative binding domains is an extremely labor-intensive endeavor. It begins with 
identification and molecular cloning of genes that code for the receptor protein of 
interest. These genes are then expressed and the target protein is isolated and 
purified. Physical studies such as X-ray diffraction, neutron diffraction and electron 
microscopy are conducted to determine 2-D maps and 3-D structure; site directed 

15 mutagenesis is conducted to determine the position of residues for ligand binding. It 
would be desirable to provide a method which eliminates as many of these steps as 
possible. 

Thus, i t is an obj e ct of th e pr e s e nt i nv e nt i on to prov i d e a m e thod for rapid l y 
d e t e rmin i ng ORP cand i dat e s for us e as rec e ptors for pr e s ele ct e d odorant mo le cul e s. 
20 I t i s a furth e r obj e ct of th e pr e s e nt i nv e nt i on to provid e a m e thod for fabr i cat i ng a 
b i os e nsor wh i ch i nc l ud e s a l ay e r of po l yp e pt i d e s th a t s ele ctiv el y b i nds a pr e s ele ct e d 
odorant mol e cul e . 
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SUMMARY OF THE INVENTION 

In one aspect, the present invention provides a method for rapidly determining 
ORP candidates for use as receptors for preselected odorant molecules. 
5 In another aspect, the present invention provides a method for fabricating a 

biosensor which includes a layer of peptides that selectively binds a preselected 
odorant molecule. 

Accordingly l n one asp e ct , the present invention provides a method for making 
a biosensor capable of detecting a gas molecule, wherein the gas molecule is a 

10 ligand fe^ capable of binding to an olfactory receptor protein. The method includes 
the steps of determining the amino acid sequence of a preselected olfactory receptor 
protein the secondary and tertiary structures of which are not known. Typically this 
step will be carried out by choosing an ORP from a database of ORP-s which have 
been sequenced. In the next step the ammo acid sequence of the ORP selected in 

15 the first step is compared to the sequence of G-coupled protein receptors having 
known secondary and tertiary structures. This step will typically be carried out by 
accessing a database of G-protein receptors having known primary, secondary and 
tertiary structures. Next, based on primary sequence homology, one or more 
G-coupled protein receptors isare chosen as a candidate on which to predict the 

20 secondary and tertiary structure of the unknown ORP. In the next step, the 
secondary and tertiary structures of the unknown ORP are approximated based on 
the known structures of the G-protein receptor selected through sequence homology 
comparison in the prior steps. The approximated secondary and tertiary structures 
of the unknown ORP are then analyzed using conventional modeling techniques to 



identify likely binding domains for the ligand of interest. A polypeptide is then 
synthesized having the primary sequence of the most likely binding domain for the 
ligand. These polypeptides are attached to a transducer. The resultant biosensor 
is then tested by exposing it to the target ligand and determining binding efficiencies. 
5 By identifying and testing a number of polypeptides in this manner, high affinity 

biosensors can be rapidly fabricated. 

BRIEF DESCRIPTION OF HE DRAWINGS 

10 F i qur eFig. 1 is a diagram illustrating the major pathway of olfactory 

transduction. 

FigureFig. 2 is a flow chart illustrating the modeling steps of the present 
invention. 

F i qur eFig. 3 is an amino acid sequence for OLFD CANFA (P30955). 
15 Fig. 4 is a three dimensional structure showing the simulation results of the 

olfactory receptor protein, OLFD CANFA (P30955), docking with trimethvlamine 
which is shown as spherical molecular models. 

F i gur e 4 Fig.5 is a perspective view of a transducer made in accordance with a 
preferred embodiment of the present invention. an am i no ac i d sequ e nc e for ORP 
20 P30955. 

Figure 5 i s a tab lo i l l ustrating fr e quency chang e s r e su l ting from attachm e nt of 
l igands to a po l yp e pt i d e mad e i n accordanc e w i th th e pres e nt i nv e nt i on. 
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DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS 



The detailed embodiment of the present invention is disclosed herein. It 
should be understood, however, that the disclosed embodiment is merely exemplary 
of the invention, which may be embodied in various forms. Therefore, the details 
disclosed herein are not to be interpreted as limited, but merely as the basis for the 
claims and as a basis for teaching one skilled in the art how to make and/or use the 
invention. 

Fig. 2 is a flow chart illustrating the modeling steps of the preferred 
embodiment. Referring now to Figuro Fig. 2 of the drawings, an olfactory receptor 
protein which has been sequenced is selected in step 210 . Of course, it may be 
desirable in some cases to actually clone, express, isolate and sequence a new ORP; 
however, in most instances an ORP will be chosen from a sequence database having 
the primary amino acid sequence of various ORPs. One preferred database for use 
in the present invention is available on the ExPASy server of the Swiss Institute of 
Bioinformatics. Other similar databases or print sources may be equally suitable. 

Once the ExPASy server has been accessed, th e fil o o nt i t lo d the "SWISS 
PROT and TrEMBL- prot e in s o qu o nc o s " database is opened. The ExPASy server is 
open to the public and may be accessed via the Internet. Next, using the keyword 
search features of this file, the key words "olfactory receptor" may be used to create a 
subset of sequences of olfactory receptor proteins. An ORP is then selected, the 
sequence of which is to be used in the practice of the invention. The known 
sequence is displayed along with additional information on the ORP such as EMBL 
cross references, length and molecular weight. The amino acid sequence 



information is generally subdivided into potential extracellula r, transmembrane and 
cytoplasmic domains , which are predicted and provided only for reference . For 
example, an ORP, OLFD CANFA (P30955), is selected from the "SWISS PROT and 
TrEMBL" database. The amino acid sequence is shown on Fig. 3, and the predicted 
5 secondary-structure features of OLFD CANFA (P30955) are listed in Table 1 . 



Table 1 



Key 


Position 


Length 


Description 




Domain 


1-25 


25 


Extracellular (potential) 




Transmem 


26-49 


24 


1 (potential) 




Domain 


50-57 


8 


Cytoplasmic (potential) 




Transmem 


58-79 


22 


2 (potential) 




Domain 


80-100 


21 


Extracellular (potential) 




Transmem 


101-120 


20 


3 (potential) 




Domain 


121-139 


19 


Cytoplasmic (potential) 




Transmem 


140-158 


19 


4 (potential) 




Domain 


159-195 


37 


Extracellular (potential) 




Transmem 


196-218 


23 


5 (potential) 




Domain 


219-235 


11 


Cytoplasmic (potential) 




Transmem 


236-259 


24 


6 (potential) 




Domain 


260-271 


12 


Extracellular (potential) 




Transmem 


272-291 


20 


7 (potential) 




Domain 


292-313 


22 


Cytoplasmic (potential) 





I n th o noxt stop of the invention, the soqu o nc o of tho ORP of unknown 
10 s e condary and t e rt i ary structures i s comp a r e d to s e qu e nc e s of prot ei ns h a v i ng 
known s e quences and known s e condary structur e s. — Most pr e f e rably, th e dat a b a s e 
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of proteins w i th known secondary structures i s compriood of G coup l od receptor 
proteins. — I t w ill bo apprec i at e d by those ski ll ed i n the art that olfactory rec e ptor 
prot ei ns ar e a c l ass of G - coup le d r e c e ptor prot e ins. — Th i s compar i son i s pr e f e rably 
carr i ed out us i ng a publicly ava i lable databas e . — Most preferab l y. In step 220 of Fig. 

5 2^ the predicted secondary structure , such as a -helix, 8 -sheet, and 
transmembrane regions, of the ORP under investigation is determined by using, for 
example, the "PredictProtein" server of the "BlOcomputing 3D Modeling Unit Service" 
w o bpao e ( B Rost: PHD: predicting one-dimensional protein structure bv profile based 
neural networks. Methods in Enzvmolgy, 266. 525-539, 1996 Prod i ctProto i n:B Rost 

10 (1096) Methods i n Enzymo l ogy, 266:525 539; Ur l : http://dodo.cprnc.co l umb i a.edu ). 
The "PredictProtein" server can be accessed through worldwide web sites. The 
service of "PredictProtein" includes sequence analysis and structure prediction. 
One can submit any protein seguence. and then "PredictProtein" retrieves similar 
sequences in the database and predicts aspects of protein structure. The 

15 "PredictProtein" server uses includos the fo ll o wing several programs and database, 
such as those listed in Table 2. to predict protein's structure- 



Table 2 



Proqram's Type 


Proqram 


Function 




Aliqnment and 


MaxHom 


MaxHom is a dynamic multiple sequence aliqnment 




database searchinq 




proqram which finds similar sequence in a database. 


methods 






ProSite 


ProSite is a database of functional motifs. 
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Sequence motif 
searching methods 



ProSite 



ProSite is a database of functional motifs. 



Sequence motif 
searching methods 



sega^re3^\fflHljrBfaj^f8§". for domains corresponding to 



PMDacc 



^Depc^b^iridgrD^sitg$arf^ solvent accessibility from 



multiple sequence alignments. 



PHDhtm 



PHDhtm predicts the location and topology of 
transmembrane helices from multiple seguence 



alignments. 



GLOBE 



GLOBE predicts the globularity of a protein. 



TOPITS 



TOPITS is a prediction-based threading program, that 
finds remote structural homologues in the DSSP 



database. 



COILS 



COILS finds coiled-coil regions in your protein. 



EvalSec 



EvalSec evaluates secondary structure prediction 
accuracy. 
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PHDs e c (pr e d i cts secondary structur e from mu l tip l e s e qu e nc e a li gnm e nts), 



PMDacc — (pr e d i cts — p e r r e s i du e — so l v e nt acc e ss i b ili ty from — mu l t i p le — s e qu e nc e 
a li gnm e nts), PHDhtm (pr e dicts th e l oc a tion and topo l ogy of transm e mbran e h eli c e s 
from mu l tipl e s e quenc e al ignments), GLOBE, TOPITS, MaxHom (dynam i c mu l t i ple 
s e qu e nc e a li gnm e nt program which f i nds s i m il ar s e qu e nc e i n a databas e ), Ev al S e c, 
CO I LS, ProS i t e (f i nds funct i ona l mot i f i n th e s e qu e nc e b e ing i nv e st i gated), SEC a nd 
ProDom (databas e of putat i ve prot ei n domains; search e d with BLAST for doma i ns 
correspond i ng to s e qu e nc e b ei ng i nv e st i g a ted) programs. — 

In essence, these servers allow the sequence of the ORP to be submitted for 
comparison to the sequences of proteins in the PredictProtein database. 
PredictProtein retrieves similar sequences and predicts secondary protein structure 
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based on data for similar sequences. PredictProtein performs and displays the 
results of a "PROSITE" motif search, "ProDom" domain search, MAXHOM alignment 
header analysis, and provides information regarding accuracy of the forgoing 
analyses. This prediction of secondary structure is performed by PredictProtein 
5 using a system of neural networks. — 

The MAXHOM program produces a multiple sequence alignment file which 
serves as the input for the neural network system. The output of the MAXHOM 
analysis includes identification of aligned proteins, percentage of pairwise sequence 
identity, percentage of weighted similarity, number of residues aligned, number of 

10 insertions and deletion (indels), number of residues in all indels, length of aligned 
sequences and a short description of the aligned proteins. The preferred neural 
network for prediction of secondary structure is described in "Prediction of Protein 
Structure at Better than 70% accuracy" J. Mol. Bioi., 1993, 232, 584-599, and the 
entire disclosure of which is incorporated by reference. 

15 Prediction of solvent accessibility is also determined (PHDacc) in accordance 

with "The Analysis and Prediction of Solvent Accessibility in Protein Families" 
Proteins, 1994, 20, 216-226, and the entire disclosure of which is incorporated by 
reference. The latter prediction provides values for the relative solvent accessibility. 
Prediction of helical transmembrane segments of the ORP is performed by the 

20 PHDhtm program. In this manner, the secondary structure (helix, sheet, loop) and 
location relative to the membrane (inside, outside, transmembrane) for the ORP 
under investigation is predicted with relative accuracy. Most preferably, the 
predicted topology for the transmembrane proteins is determined using PHDtopology 
and fold recognition is determined by predicted-based threading using PHDthreader. 



Again, the secondary structure predictive determinations are verified for accuracy 
using EvalSec. All of the computer programs used in the present invention can be 
accessed by the public^ and their disclosures are incorporated herein by reference. 
(se o , o mbl h o id ol b o rg.d o /tmap_info.html). 
5 For example, primary amino acid sequence of OLFD CANFA (P30955) is 

input into the "PredictProtein" server. Since most of odorant molecules bind to 
transmembrane helices of an ORP, the predicted seven transmembrane helices of 
the OLFD CANFA (P30955) are listed in Table 3. 

10 Table 3 



Number of helix 


Sequence 


Position of amino acid 


s 










1 


FYALFLAMYVTTILGNLLIIVLIQ 


27-50 




2 


LHTPMYLFLSNLSFSDLCFSSV 


55-76 




3 


LTQMYFFLFFGDLESFLLVAMAYD 


98-121 




4 


CFSLLVLSWVLTMFHAVLHTLLM 


141-163 




5 


VIFIMGGLILVIPFLLIITSYARIV 


197-221 




6 


SHLSVVSLFYGTVIGLYL 


242-259 




7 


MAMMYTVVTPMLNPFIYS 


273-290 





In Fig. 2, a Basod on tho rocu l ts of th o secondary structure predict i on ana l ysis, 
th e s e qu e nc e s fo r fter determining the predicted seven transmembrane helices-afe 
d e t e rmin e d. — Next, a template protein used to predict the approximated tertiary 
15 structure of the transmembrane helices are dotorm i nod selected in step 230 . Most 



n 



preferably t This is preferably achieved in the present invontion preferred embodiment 
using the Swiss-Model 7TM interface program and, preferably, BLAST (Basic local 
alignment search tool as described in J. Mol. Biol. 215:403-410, the entire disclosure 
of which is incorporated herein by reference). To begin, the complete sequence of 
5 the ORP under investigation is input i n the BLAST program through Swiss-Model 
interface, wktefr- and theft- then the BLAST program determines the most appropriate 
modeling template to be used in the tertiary structure investigation. The modeling 
template will be that protein (of known primary, secondary and tertiary structures) 
having the highest primary sequence homology and similar secondary structure with 

10 the ORP to be investigated. 

For example, the primary amino acid seguence of the ORP, OLFD CANFA 
(P30955), is input through the Swiss-Model interface. I n oth o r words, using BLAST. 
tjhe primary sequence of the OLFD CANFA (P30955) ORP under invest i gation is 
compared to the sequences of proteins in the 7TM (seven transmembrane) subset of 

15 the SWISS-PROT database by the BLAST program, since OLFD CANFA (P30955) 
also has seven transmembrane helices . Then, a number of BALST-assisted 
templates, as listed in Table 4, are obtained. In Table4, Neuropeptide Y1 receptor 
(P25929) has the largest P(N). That is. Neuropeptide Y1 receptor (P25929) has the 
highest primary seguence homology with the OLFD CANFA (P30955). Hence, 

20 Neuropeptide Y1 receptor (P25929) is selected to be the modeling template of 
OLFD CANFA (P30955). 

Table 4 
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SWISS-PROT 
Code 


Seven helices modelinq 
template 


Smallest Poisson Probability 




P(N) 


N 




P25929 


Neuropeptide Y1 receptor 
(Homo sapiens) 


42 


6.1x10" 2 




P07550 


Beta-2 adrenerqic receptor 
(Homo sapiens) 


37 


2.8X10 -1 




P21452 


Substance-K receptor 
(Neurokinin A receptor) 


39 


7.0x1 rj 4 




P02699 


Rhodopsin 
(Bos Taurus) 


41 


5.1 x10' 8 




P02945 


Bacteriorhodopsin 
(Halobacterium halobium) 


*NA 


*NA 




*NA: not avai 


able. 





After the modeling template has been selected, the sequences of the helical 
regions are displayed and the sequences of the helices of the ORP under 

5 investigation (as determined in the secondary structure an a lysts analysis step of the 
present invention) are input ( through Swiss-Model 7TM interface program in step 
240 ). That is, the helical regions of the template are aligned with the helical regions 
of the ORP under investigation. The comparison yields a prediction of the tertiary 
structure (3D in space) of the ORP being investigated on an atom-by-atom basis. 

10 The tertiary structure of the ORP being under investigated is preferably output as a 
file containing three coordinates of each atom in the ORP. For example, a lengthy 



list of three coordinates of each atom in the OLFD CANFA (P30955) was obtained. 

The preferred protocol for th i s st e p takes taken into consideration for the step 
240 includes energy minimization and the like as described in: PromMod and 
Swiss-Model: Internet-based Tools for Automated Comparative Protein Modeling, 
5 Biochem. Soc. Trans. V. 24 274 1996; Large-Scale Comparative Protein Modeling, 
Proteome Research: New Frontiers in Functional Genomics 177 1997; Swiss-Model 
and the Swiss-PDBviewer; an Environment for Comparative Protein Modeling, 
Electrophoresis, V. 18 2714 1997; Automated Modeling of the Transmembrane 
Region of G-Protein Coupled Receptor by Swiss-Model, Receptors; and Channels v. 

10 4 161 1996; Protein Modeling by email, Bio/Technology V. 13 658 1995, the 
disclosures of which are incorporated by reference. 

(The preferred modeling software programs which can be used in the present 
invention have a high degree of sophistication. For example, ProMod. which is 
under SWISS-MODEL Repository of the ExPASv Molecular Biology Server, applies a 

15 Protein Modelling tool which requires similarities with experimentally determined 
protein structures. ProMod It is a knowledge-based approach to predictive structure 
determination. It requires at least one known 3D structure of a related protein and 
good quality sequence alignments; the degree of sequence identity affects the 
accuracy of the predictive structure. In ProMod, there is a superposition of related 

20 3D structures. A multiple alignment with the sequence under investigation is made. 
A framework for the new sequence is made and any missing loops are rebuilt. The 
backbone of the structure is completed and corrected if required. Side chains are 
corrected and rebuilt. The resultant structure is verified and packing is checked. 
The structure is then refined by energy minimization and molecular dynamics 



considerations.) 

The tertiary structures of the helices of the ORP under investigation are thus 
determined in step 240 and may be viewed stereoscopically using a program such as 
Insight II , a commercial program provided by Molecule Simulations Inc. Acc el rys 
5 toerand now is provided by Accelrys Inc., e^Swiss PDB-viewer or the like. Next, in 
step 250, a ligand , i.e. a gas , pr e ferab l y on e wh i ch i s known to b i nd to th e ORP und o r 
i nvest i gation, is selected. A number of assays may be used to determine high 
general binding affinities of various ligands for the ORP under investigation. The 
molecular structure of the ligand and the ORP under investigation is then input to the 

10 Insight II program, i.e. the tertiary or 3D structures of ORP helices and the ligand are 
input. Next in step 260 , the most probably geometrical binding domains of the ORP 
under investigation and the ligand are determined, preferably using the Global Range 
Molecular Modeling program (GRAMM) which ut ili zos by geometric recognition 
algorithms. As will-fee-understood by those skilled in the art, GRAMM is a program 

15 for protein docking , and it treats the ORP and the ligand as rigid bodies. ^ Since 
GRAMM utilizes geometric recognition algorithms to determine the most probably 
geometrical binding domains of a protein for a ligand, no specific information about 
the binding sites is required. It performs a six-dimensional search through the 
relative translations and rotations of molecules. It takes an empirical approach to 

20 smoothing the intermolecular energy function by changing the range of atom-atom 
potentials. It allows the user to locate the area of the global minimum of 
intermolecular energy for structures of different accuracy. 

Then in step 270, the structures of the ORP and the ligand are allowed to relax. 
That is, the structures of the ORP and the ligand are flexible. Hence, the bond 

15 



stretching, valence angle bending, torsion, van der Waals force, and electrodtatic 
force of both the ORP and the ligand are taken into consideration. Affinity Docking 
program, an embedded program of Insight \\ x may th e n b e is preferably used to 
calculate the energy distribution and reaction forces between the ligand and the 
5 geometrically p r o d i c to d biniding domains , as predicted by GRAMM, of the ORP by 
molecular mechanics calculations using the energy minimization algorithm . The 
most probably overall binding domains are thus determined , and the user can read 
out the seguence of the binding domains by move the mouse to each amino acid of 
the binding domains . 

10 For example, the most probably binding domains, as shown in Fig. 4, of the 

OLFD CANFA (P30955) for trimethvlamine is predicted. The trimethvlamine 
molecules are shown as spherical molecular model, and the OLFD CANFA (P30955) 
is shown as cartoon structure. The eight most probably binding domains of the 
OLFD CANFA (P30955) for the trimethvlamine are located in transmembrane 1, 

15 transmembrane 3, and transmembrane 5. 

Po l yp o pt i dos Peptides are then synthesized corresponding to these most 
probably binding domains using conventional synthesis technologies. The 
polypeptides are then applied to the surface of a transducer, preferably one 
fabricated using thin film (semiconductor) technique as will be known to those skilled 

20 in the art. Briefly, with reference to Figur eFig. 35, bios e nsor 10 is s oo n hav i ng 
transducers 42- 510 coated with polypeptide layer 44 520 are on biosensor 500 . 
Transducer 42- 510 is preferably a piezoelectric quartz crystal-based device. A new 
change will occur if a ligand binds to the polypeptide layer resulting any a-measurable 
frequency change in the quartz crystal frequency, allowing detection of ligand binding. 



The success and efficiency of the transducer can be determined, including by 
comparing the sensor's response to the ligand and other molecules. 

For example, peptides synthesized according to the most probably binding 
domains of OLFD CANFA (P30955) for trimethylamine are peptides B1, B2, and B3. 
5 The amino acid seguences of the peptides B1, B2, and B3 are DPDQRDC, 
GDLESFC, and CFFLFFGD. These peptides all have or are added a cystein 
(symbol C) residue at one terminal. The transducers of a biosensor have gold 
electrodes, the -SH functional group of the cystein can react with gold electrodes 
directly in an organic solution to form chemical bond between them. Hence, a 

10 simple way to attach these peptides is dipping the surface of gold electrode on 
piezoelectric guartz with the peptide solution under room temperature for a period of 
time. Therefore, these peptides can be attached on the surfaces of the transducers. 
Then, after attaching these peptides on the transducers of a biosensor, the biosensor 
is used to detect various gases such as trimethylamine, dimethylamine, ammonia, 

15 acetone, formic acid, ethanol, and formaldehyde. The experimental results of the 
peptides B1, B2, B3, and Pb2 are listed in Table 6, wherein the peptide Pb2 is not 
designed according to the most probably binding domains of the ORP, 
OLFD CANFA (P30955). 

20 Table 6 



Gas detected 


Frequency chanqes (Hz) 




Bl 


B2 


B3 


Pb2 




Trimethvlamine (5.86 ppm) 


5696 


488 


687 


221 




Dimethvlamine(3.78 pom) 


3851 


578 


721 


589 





17 



Ammonia (4.86 dditi) 


1022 


206 


209 


345 




Acetone (7.21 dditO 


13 


9 


9 


31 




Formic acid (1.33 dditi) 


161 


56 


85 


97 




Ethanol (4.68 DDm) 


-5 


6 




16 




Formaldehyde (6.54 dditO 


-25 


-22 


-21 


19 





Peptide sequence of B1 : DPDQRDC 
Peptide sequence of B2: GDLESFC 
Peptide sequence of B3: CFFLFFGD 
Peptide sequence of Pb2: LFLSNLSFSDLCA 



In Table 6, the numbers shown on each column under each peptide are 
frequency changes of the quartz crystal vibration frequency. Hence, the absolute 
value of the number is larger and the sensitivity for the gas detected is larger. For 
the desirable detected gas, trimethvlamine, all peptides B1, B2, and B3 show a much 
more sensitive response then the peptide Pb2 designed by other methods. 

It will be apparent to those skilled in the art that various modifications and 
variations can be made to the structure of the present invention without departing 
from the scope or spirit of the invention. In view of the foregoing, it is intended that 
the present invention cover modifications and variations of this invention provided 
they fall within the scope of the following claims and their equivalents. 
Exampl e s: 

Th e fo ll ow i ng e xamp l es ar e i nt e nd e d to furth e r il lustrate th e pr e s e nt i nv e nt i on. 
A G - Prot e in Coup le d R e c e ptor databas e w a s acc e ss e d a nd th e s e qu e nc e of 
an ORP of known pr i mary s e qu e nc e , but unknown s e condary and t e rt i ary structur e s 
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was retriev e d (SW I SS PROT: P30955) as shown in F i gure A. — It cons i sts of 330 
am i no and has a mo le cu l ar we i ght of 35197 da l tons. — Th e s e condary structure was 
pred i cted and i ts a ccur a cy v e r i f ie d through th e us e of MaxHom, PHDs e c, PHD a cc, 
PHDhtm, PHDtopo l ogy, PHDthroader and Eva l Soc. — Tho transm e mbrane s e quence 

5 r e gions woro thus obtained. — A BLAST assist e d t e mplat e was th e n s ele ct e d: 
N e urop e pt i d e Y1 r e ceptor (Homo sap ie ns). — Tr i m e thy l am l n e was se le ct e d as th e 
l igand. — Us i ng GPvAMM, several possib le binding domains woro ident i f ie d and 
corr e spond i ng po l yp e ptid e s w e re generat e d. — I n F i gur e 5, (po l y)p o ptid e B1 des i gned 
i n accordanc e with th o pr e s e nt i nv e nt i on il l ustr a t e s b e tter respons e for tr i m o thy l am i n e 

10 than anoth e r (po l y)pept i d e Pb2. 
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CLAIMS 



What is claimed is: 

1 . A method of making a biosensor capable of detecting a molecule, wherein 
the molecules is a ligand fof^ beinq capable of binding an olfactory receptor protein, 
said method comprising the steps of: 

(a) ©determining the amino acid sequence of a preselected olfactory 
receptor protein, the secondary and tertiary structures of said olfactory 
receptor protein being unknown; 

(b) Gcomparing the amino acid sequence of said preselected olfactory 
receptor protein to known amino acid sequences of transmembrane 
proteins having known, secondary and tertiary structures, said known 
amino acid sequence of said transmembrane proteins being selected 
from the group consisting of G-protein coupled receptors; 

(c) Sselecting at least one of said known amino acid sequences of 
transmembrane proteins by determining which of said known amino 
acid sequences has the highest degree of sequence homology with the 
amino acid sequence of said preselected olfactory receptor protein; 

(d) Uusing said selected sequence to approximate the secondary and 
tertiary structure of said preselected olfactory receptor protein; 

(e) fusing said approximated secondary and tertiary structures of said 
olfactory receptor protein to identify a likely binding domain of said 
olfactory receptor protein for said ligand; 
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(f) Ssynthesizing a polypeptide having the primary structure of said likely 
binding domain; and 

(g) Aattaching said synthesized polypeptide to the surface of a transducer. 



5 2. A method of making a biosensor capable of detecting a gas molecule, the 

method comprising: 

obtaining an amino acid seguence of an olfactory receptor protein ORP; 
using the amino acid seguence of the ORP to compute a predictive secondary 
structure of the ORP and predict transmembrane fragments of the ORP; 
10 comparing the amino acid seguence of the ORP with G-protein coupled 

receptors (GPCRs) by Basic Local Alignment Search Tool (BLAST) to find a template 
protein having the highest primary seguence homology with the ORP, wherein the 
primary, secondary and tertiary structures of the GPCRs are known and the 
secondary structures of the GPCRs are similar to the predictive secondary structure 
15 of the ORP; 

using the tertiary structure of the template protein to be an initial tertiary 
structure for the ORP to compute a predictive tertiary structure of the ORP by energy 
minimization and molecular dynamics to get three coordinates of each ORP's atom; 

treating the ORP as a rigid body, according to the three coordinates of each 
20 ORP's atom, to calculate geometrical binding domains of the ORP with a gas 
molecule by geometric recognition algorithms; 

relaxing the ORP's structure to further calculating the most probably binding 
domains, based on the geometrical binding domains, of the ORP with the gas 
molecule; 
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synthesizing peptides according to the primary amino acid sequence of the 
likely binding domains; and 

attaching the synthesized peptide to a surface of a transducer having a gold 
electrode on a piezoelectric guartz to be a receptor of a biosensor. 

3. The method of claim 2, wherein a molecule containing thiol functional group 
is added to a terminal of the synthesized peptides when two terminals of the 
synthesized peptides do not have thiol functional group. 

4. The method of claim 3, wherein attaching the synthesized peptide to the 
surface of the transducer comprises: 

dissolving the synthesized peptides in a solvent to form a peptide solution; and 
dipping the gold electrode in the peptide solution under room temperature. 

5. A method of finding candidates for peptides being covered on an electronic 
nose for detecting a gas, the method comprising: 

obtaining an amino acid seguence of an olfactory receptor protein ORP; 

using the amino acid seguence of the ORP to compute a predictive secondary 
structure of the ORP and predict transmembrane fragments of the ORP; 

comparing the amino acid seguence of the ORP with G-protein coupled 
receptors (GPCRs) by Basic Local Alignment Search Tool (BLAST) to find a template 
protein having the highest primary seguence homology with the ORP, wherein the 
primary, secondary and tertiary structures of the GPCRs are known and the 
secondary structures of the GPCRs are similar to the predictive secondary structure 
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of the ORP: 

using the tertiary structure of the template protein to be an initial tertiary 
structure for the ORP to compute a predictive tertiary structure of the ORP by energy 
minimization and molecular dynamics to get three coordinates of each ORP's atom: 

treating the ORP as a rigid body, according to the three coordinates of each 
ORP's atom, to calculate geometrical binding domains of the ORP with a gas 
molecule by geometric recognition algorithms: and 

relaxing the ORP's structure to further calculating the most probably binding 
domains, based on the geometrical binding domains, of the ORP with the gas 
molecule, whereby amino acid seouences of the most probably binding domains are 
candidates for peptides being covered on an electronic nose. 
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ABSTRACT OF THE DISCLOSURE 



A method for rapidly designing a biosensor which binds a preselected ligand to 
a layer of polypeptides. The polypeptide layer is made of relatively short molecules 
representative of a binding domain of an Olfactory Receptor Protein. The method 
uses a series of predictive structure determinations to obtain the sequence of the 
polypeptides applied to a transducer surface. 
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